[Merged by Bors] - Deduplicate block root computation #3590

paulhauner · 2022-09-20T01:45:24Z

Issue Addressed

NA

Proposed Changes

This PR removes duplicated block root computation.

Computing the SignedBeaconBlock::canonical_root has become more expensive since the merge as we need to compute the merke root of each transaction inside an ExecutionPayload.

Computing the root for a mainnet block is taking ~10ms on my i7-8700K CPU @ 3.70GHz (no sha extensions). Given that our median seen-to-imported time for blocks is presently 300-400ms, removing a few duplicated block roots (~30ms) could represent an easy 10% improvement. When we consider that the seen-to-imported times include operations after the block has been placed in the early attester cache, we could expect the 30ms to be more significant WRT our seen-to-attestable times.

Additional Info

NA

michaelsproul · 2022-09-21T03:48:59Z

This is also part of the consensus context on tree-states, maybe we should work together to pull that out and consolidate it with this and #3263?

paulhauner · 2022-09-22T06:58:27Z

This is also part of the consensus context on tree-states, maybe we should work together to pull that out and consolidate it with this and #3263?

Sounds reasonable. This does a fair amount of work in the networking stack to remove duplicate hashes there too, does the consensus context extend down into the networking side of things?

paulhauner · 2022-09-22T08:03:24Z

does the consensus context extend down into the networking side of things?

I can figure that out myself! I'm going to start playing with tree-states now :)

michaelsproul · 2022-09-22T08:43:04Z

As you may have already discovered, it does not do any magic in networking land 😅

Maybe we can merge this PR almost as-is and I can open a follow-up PR with the consensus context. I've been thinking it would be cool to have a bunch of per-slot and per-epoch caches that get attached to the context to accelerate various things. And we could build these caches off the hot path of block verification by doing it for the next slot after the processing of a new block (maybe in state advance)

michaelsproul

I've checked this thoroughly and I'm satisfied that it's correct. Excited to improve our block processing times further 🚀

This optimisation has the nice property that it's quite hard to screw up, because the block makes no claim as to its own block root, so if we have a block root available then it must have been computed from the block. The only risk would be using tree_hash_root() on the signed beacon block rather than canonical_root(), and I've checked that this PR also doesn't do that.

Let's merge!

paulhauner · 2022-09-23T03:52:28Z

bors r+

## Issue Addressed NA ## Proposed Changes This PR removes duplicated block root computation. Computing the `SignedBeaconBlock::canonical_root` has become more expensive since the merge as we need to compute the merke root of each transaction inside an `ExecutionPayload`. Computing the root for [a mainnet block](https://beaconcha.in/slot/4704236) is taking ~10ms on my i7-8700K CPU @ 3.70GHz (no sha extensions). Given that our median seen-to-imported time for blocks is presently 300-400ms, removing a few duplicated block roots (~30ms) could represent an easy 10% improvement. When we consider that the seen-to-imported times include operations *after* the block has been placed in the early attester cache, we could expect the 30ms to be more significant WRT our seen-to-attestable times. ## Additional Info NA

bors · 2022-09-23T05:58:54Z

Pull request successfully merged into unstable.

Build succeeded:

## Issue Addressed NA ## Proposed Changes This PR removes duplicated block root computation. Computing the `SignedBeaconBlock::canonical_root` has become more expensive since the merge as we need to compute the merke root of each transaction inside an `ExecutionPayload`. Computing the root for [a mainnet block](https://beaconcha.in/slot/4704236) is taking ~10ms on my i7-8700K CPU @ 3.70GHz (no sha extensions). Given that our median seen-to-imported time for blocks is presently 300-400ms, removing a few duplicated block roots (~30ms) could represent an easy 10% improvement. When we consider that the seen-to-imported times include operations *after* the block has been placed in the early attester cache, we could expect the 30ms to be more significant WRT our seen-to-attestable times. ## Additional Info NA

paulhauner added 7 commits September 20, 2022 10:01

Avoid root in load_parent

2e153c3

Avoid roots in gossip and sync methods

3a812b0

Remove misc networking roots

f3be7af

Thread block root through RPC block flow

469fbb2

Thread block root into block processing

5039e57

Fix compile errors in tests

35c812c

Maintain metrics for RPC blocks

a5cce2f

paulhauner added the work-in-progress PR is a work-in-progress label Sep 20, 2022

paulhauner added ready-for-review The code is ready for review and removed work-in-progress PR is a work-in-progress labels Sep 23, 2022

michaelsproul marked this pull request as ready for review September 23, 2022 03:25

michaelsproul added the optimization Something to make Lighthouse run more efficiently. label Sep 23, 2022

michaelsproul approved these changes Sep 23, 2022

View reviewed changes

michaelsproul added ready-for-merge This PR is ready to merge. v3.1.2 Release after v3.1.0 (formerly v3.1.1) and removed ready-for-review The code is ready for review labels Sep 23, 2022

bors bot changed the title ~~Deduplicate block root computation~~ [Merged by Bors] - Deduplicate block root computation Sep 23, 2022

bors bot closed this Sep 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - Deduplicate block root computation #3590

[Merged by Bors] - Deduplicate block root computation #3590

paulhauner commented Sep 20, 2022

michaelsproul commented Sep 21, 2022

paulhauner commented Sep 22, 2022

paulhauner commented Sep 22, 2022

michaelsproul commented Sep 22, 2022

michaelsproul left a comment

paulhauner commented Sep 23, 2022

bors bot commented Sep 23, 2022

[Merged by Bors] - Deduplicate block root computation #3590

[Merged by Bors] - Deduplicate block root computation #3590

Conversation

paulhauner commented Sep 20, 2022

Issue Addressed

Proposed Changes

Additional Info

michaelsproul commented Sep 21, 2022

paulhauner commented Sep 22, 2022

paulhauner commented Sep 22, 2022

michaelsproul commented Sep 22, 2022

michaelsproul left a comment

Choose a reason for hiding this comment

paulhauner commented Sep 23, 2022

bors bot commented Sep 23, 2022